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System and Method for Registering Multiple Images 
with Three-Dimensional Objects 

5 Field of the Invention 

The invention relates generally to rendering graphics images, and more particularly 
to registering multiple images with three-dimensional objects. 

1 0 Background of the Invention 

:, T''. 

Computer graphics systems can acquire or synthesize images of real or imaginary 
5 objects and scenes, and then reproduce these in a virtual world. More recently, 
y computer systems have also attempted to do the reverse - to "insert" computer 
3L5 graphics images into the real world. Primarily, this is done indirectly for special 
w effects in movies, and for real-time augmented reality. Most recently, there is a 
f trend to use light projectors to render imagery directly in real physical 
N= environments. 

20 Despite the many advances in computer graphics, the computer has yet to replace 
the actual material experience of physical shape and spatial relationships. 
Designers, such as architects, urban planners, automotive engineers, artists and 
animators still resort to sculpting physical models before the design is fmahzed. 
One reason for this is that the human interface to a physical model is totally 

25 intuitive. There are no controls to manipulate, or displays to look through or wear. 
Instead, the model can be viewed from many perspectives while gazing generally 
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or focusing on interesting components, all at very high visual, spatial, and temporal 
fidelity. 

When an object or scene is illuminated by a neutral (white) hght, it is perceived 
5 according to the particular wavelengths of the hght reflected by its surface. 
Because the attributes of the surface are dependent only on the spectrum of the 
perceived hght, many attributes of objects can effectively be simulated by 
incorporating the object's attributes into the light source to achieve an equivalent 
effect on a neutral object. Thus, even non-reaUstic appearances can be visualized. 

10 

■'S A rotating movie camera has been used to acquire a film of a living room, replete 
If. with furniture, and people. The room and furniture were then painted a neutral 
O white, and the film was projected back onto the walls and fumimre using a rotating 
W projector that was precisely registered with the original camera, see Naimark, 
ij> "-Displacements" Exhibit at the San Francisco Museum of Modem Art, San 
K! Francisco, CA 1984. This crucial co-location of the acquiring camera and 
=5 displaying projector is common to most systems that use pre-recorded images, or 
M image sequences to illuminate physical objects. 

20 A projector and fiber-optic bundle have been used to animate the head of a 
fictional fortune teller inside a real crystal ball, see U.S. Patent No. 4,978,216 
"Figure with back projected image using fiber optics" Liljegren, et al., December 
18, 1990. Shdes of modified photographs augmented with fine details have been 
used with very bright projectors to render imagery on a very large architectural 

25 scale. A well known modem realization of this idea is Le Son et Lumiere on 
Chateau de Blois in the Loire Valley of France. In addition, this medium is now 
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being used elsewhere around the world to illuminate large scale structures such as 
bridges. 

All these systems render compelling visualizations. However, cumbersome 
5 alignment processes can take several hours, even for a single projector. The 

"Luminous Room" project treats a co-located camera-projector pair as an I/O bulb 
to sense and project imagery onto flat surfaces in the real physical surroundings of 
a room or a designated workspace, see Underkoffler et al. "'Emancipated pixels: 
Real-world graphics in the luminous room," SIGGRAPH '99, pp. 385-392, 1999. 
10 Their main focus is interaction with the information via luminous and tangible 
:i interfaces. They recognized co-planar 2D physical objects, tracked the 2D 
S positions and orientations in the plane, and projected light from overhead to 
y reproduce the appropriate sunlight shadows. 

15 In the "Facade" project, a sparse set of photographs was used to model and render 
P architectural monuments, see Debevec et al. '"Modeling and Rendering 
■f Architecture from Photographs," SIGGRAPH '96, August 1996. Their main 
P problems were related to occhision, sampling, and blending issues that arise when 
re-projecting images onto geometric models. They addressed these problems with 
20 computer images and analytic models. 

It would be useful to have a graphics system that can project images onto real 
three-dimensional objects or structures. It should also be possible to fit the images 
for any viewing orientation. In addition, it should be possible to change the 
25 appearance of illuminated objects at will. Finally, it should be possible to 

seamlessly fit images from multiple projectors onto a single complex object so that 
it can be viewed from any direction. 
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Summary of the Invention 

The invention provides a novel mode for visualizing three-dimensional computer 
5 graphics. The invention can synthesize not just textures, i.e., diffuse component of 
images, but also reproduce virtually any bi-directional reflectance distribution 
function (BRDF) using a radiance adjustment procedure. The invention also 
determines weighted pixel intensities across transitions of multiple overlapping 
images in the presence of depth discontinuities. 

10 

S The present paradigm for 3D computer graphics uses multiple projectors to 
fl graphically animate physical objects or structures in the real world. The concept is 
0 to replace a physical object having inherent color, texture, and material properties 
W with a neutral object illuminated by projected imagery, reproducing the original or 
1^ alternative appearance directly on the object. In this approach, the desired visual 
M properties of the object are effectively "lifted" into the projector. 

P Specifically, a computer implemented method registers an image with a 3D 

physical object by first acquiring a 3D graphics model of an object. Multiple 3D 

20 calibration points a surface of the object and corresponding 3D model calibration 
points in the 3D graphics model are identified. The object is illuminated with a 
calibration image using a projector at a fixed location. The calibration image is 
aligned with each of the 3D calibration points on the surface of the 3D physical 
object to identify corresponding 2D pixels in the calibration image, and then a 

25 transformation between the 2D calibration pixels and the corresponding 3D model 
calibration points is determined to register the projector with the 3D physical 
object. 
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Brief Description of the Drawings 

Figure 1 is a flow diagram of a system and method according to the invention; 

Figure 2 is a block diagram of a procedure for resolving multiple coordinate 
systems while rendering graphics images; 

Figures 3a-b are graphs of radiance at a point in a given direction; 

Figure 4 is a flow diagram of a registration method used by the invention; 

Figure 5 is a graph of prior art intensity correction; 

Figure 6 is a graph of cross-fading according to the invention; 

Figure 7 is a flow diagram of a method for correcting image intensities; 

Figure 8 is a block diagram of a procedure for correcting image intensities; 

Figure 9 is a flow diagram of a method for simulating motion of a static object in a 
scene. 



CR-1341 
Raskar et al. 

Detailed Description of the Preferred Embodiment 
Introduction 

5 Our invention combines three-dimensional (3D) computer graphics and animation 
with 3D physical objects and visual attributes of real scenes to allow a user to view 
physical objects and structures enhanced with virtual images. We treat the 
illumination of a real world item as a perspective projection of 3D images where 
images are aligned using computer vision techniques. We not only provide textures 

10 for the physical object, the background in the scene, that is the diffuse component 

5 of illumination, but we also can reproduce virmally any bi-directional reflectance 
distribution function (BRDF) to change the appearance of objects and the rest of 

5 the scene. In other words, our invention applies view-independem and view- 

W dependent visual effects to real world structures. 

|5 

ffl We describe a practical method for illuminating physical objects and scenes using 
+; a radiance adjustment procedure to guide the rendering process with correction of 
M corresponding image intensities. We also provide solutions for a complete 

illumination of physical objects by merging images from multiple projectors on a 
20 complex surface. We solve the problem of merged and fragmented overlaps in the 
presence of depth discontinuities so that the resultant reconstructed illumination is 
seamless. This process can also be used by image-based rendering (IBR) 
applications where multiple, depth-enhanced images from different views need to 
be merged to render novel views. 

25 

Our system can be used to illuminate objects such as tabletop items, industrial 
parts, clay models, building interiors, statues, architectural monuments, entire 
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buildings and bridges. We also provide animation, such as movement of virtual 
light sources and virtual rotation and linear motion of a static symmetrical physical 
objects. We also simulate motion of a static object and background by illuminating 
different parts of a physical scene with images of moving features. Our invention 
5 enable a "walk-through" of a hybrid room-sized virtual and real environment with 
tracked illumination. 

Our invention addresses a number of problems associated with illuminating 
physical objects. First is the problem of geometric registration where one or more 
1 0 projected images need to be exactly matched with a 3D physical object. Prior art 
S methods typically achieve this registration using extremely tedious and inaccurate 
*? manual methods. Below, we describe an automated registration method. 

W Second is the problem of changing the appearance of a physical object. For 

15 example, we provide techniques to make a matte object appear shiny, and a 

m stationary object to rotate. Third is the problem of completely illuminating a non- 

* : trivial physical objects in the presence of shadows and discontinuities due to self- 

P occlusion. Fourth is the problem of changing the appearance of a moving object. 

For example, we provide a solution for illuminating a moving object while 

20 changing the appearance of the object, and perhaps the background. 

System Overview 

Model Acquisition 

25 

As shown in Figure 1, a system 100 according to our invention begins by scanning 
a real world three-dimensional (3D) physical object or structure 101 in order to 
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acquire a 3D graphics model 1 1 1 of the object. For example, the 3D physical 
object 101 is a scaled-down wooden reproduction of the Taj Mahal. The Taj Mahal 
miniature is spray-painted a neutral (white) color. Its dimensions are approximately 
70 cm X 70 X 35 cm. 

5 

The 3D physical object 101 can be scanned, for example, by a 3D touch probe 
sensor that gives readings with an accuracy of 0.5 mm. The scanner 1 10 can also 
user other technique, such as sonar or laser ranging, stereo vision, 3D imaging, and 
the like, to acquire the exact geometry of the 3D graphics model 111 representing 
10 the 3D physical object 10 1 . 

The graphics model can be stored in a computer memory as a triangle mesh, point 
O cloud, or volumetric data. Other easy to render graphics data structures may also be 
W used to represent the 3D physical object 101 . In the preferred embodiment, the Taj 
15 Mahal model 1 1 1 is made up of 21,000 triangles. It should be noted that at this 
W point the model 1 1 1 is view independent and "undecorated." That is, the model is 
=P entirely specified by connected vertices and their orientations (normals). 

While the object is scanned, a set of 3D calibration points 1 12 are selected on the 
20 surface of the object, and the corresponding 3D model calibration pomts in the 3D 
graphics model are identified. The model calibration points are related to 
corresponding calibration pixels 114 in calibration images as described in greater 
detail below. For example, the calibration points 1 12 are easy to locate comers of 
the 3D physical object 101. The 3D calibration points and pixels are used by an 
25 optional automated registration step 400 described in greater detail below. This 
step "registers" a projector 160 with the object 101. The registration step 400 
determines extrinsic and intrinsic parameters 401 to be used while rendering 
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computer graphics images. The projection of a perspective illumination device has 
up to eleven degrees of freedom, six external and five internal. Therefore, a 
minimum of six fiducial points are required in order to resolve the extrinsic and 
intrinsic parameters that will allow us to exactly determine the necessary transform 
5 to align projected images of the model 1 1 1 with the 3D physical object 101 . To 
increase the accuracy of the calibration 400, we use at least twenty points. 

Model Editing 

10 During step 120, the 3D model 1 10 is enhanced or "decorated" with view- 
5 independent texture and view-dependent material characteristics to reflect a desired 
fi appearance of the object. This step can use any number of available computer 
5 graphics authoring tools. Typically, texture is defined by colored patterns. We use 
W fifteen texture maps. The material appearance characteristics determine how the 
15 object should appear when viewed from different angles. For example, a shiny or 

metallic object will appear differently than a dull object under different viewing 
-P and lighting arrangements. 

Normally, in the physical world, the texture and lighting associated with the 
20 surface of a physical object are an integral part of the object. We express these 
object characteristics using modified bi-directional reflectance distribution 
functions (BRDF), described in greater detail below. A number of hardware and 
software solutions are available to generate a highly detailed, virtual 3D graphics 
model 122. In an alternative embodiment, the model is interactively edited by 
25 applying a hand-held virtual "paint brush" tool directly to the 3D physical object 
101. The results of the user interaction can be stored in the computer memory for 
concurrent or later projection. 
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Rendering 

Step 130 renders an image 131 of the virtual 3D graphics model 122 using a 
5 rendering engine so that the appearance of the 3D physical object 101 can be 
modified in novel ways. In addition to the virtual 3D model 122, the rendering 
engine 130 also takes as input the extrinsic and extrinsic parameters 401 
determined during registration 400 and a location 132 of a user viewing the 3D 
physical object 101, i.e., in classical terms, the "camera" location. 

10 

5 For the purpose of correctly computing specular highUght effect, we can use, e.g., 

an Origin Instruments DynaSight™ optical tracking system 135 to track the 
5 location 132 of a moving user. The system 135 includes a compact, self-contained 
W stereo-sensor that measures the instantaneous three-dimensional position of a 
15 passive target. The sensor automatically acquires and tracks the user in an 
ffi uncontrolled environment. 

5 The input also inchides a location 133 of a virtual light that is used to illuminate 
the model 111. 

20 

Although the rendering view defined by the pose (location and orientation) of the 
projector remains fixed, the shading view is specified dependent on the user's 
location. In a traditional renderer, it is assumed that the viewer is directly behind or 
in front of the projector. That is, the viewer is aligned with the system's optical 
25 axis. This is not necessarily the case with our system 100. Therefore, the view 
setup for the rendering engine 130 needs to be modified to take account for a 
moving viewer. 

10 
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Figure 2 shows the steps of the procedure 200 that performs this setup. Essentially, 
the procedure 200 sets up separate transformation matrices for projector and 
shading parameters that are viewer dependent. The procedure is designed to 
5 operate with, for example, the OpenGL rendering programs. 

When we simulate motion of static objects, we project images of moving objects 
on the static objects. For example for a rotationally symmetric vase, we project 
10 images generated from a rotating model of the vase to give the illusion of virtual 
5 motion of the static object. We take this concept further when illuminating a scene 
■0 made of multiple objects. We project images corresponding to different but 
0 consistent motions on different parts of the scene. For example, to generate the 
W effect of a car moving on a road, we project an image where the portion 
15 illuminating the wheels rotate about the axle of the wheels, while the road under 
m the car or other portions of the back-drop move backwards. In addition, the motion 
£ of the wheels and the motion of the back-drop is consistent. As the wheels rotate 
5 faster, the backdrop translates at a higher rate giving the illusion that the static car 
is indeed moving faster. 

20 

Intensity Correction 

Step 140 corrects the intensity of the rendered image 131 on a per pixel basis to 
take into account for the orientation of the neutral surface of the 3D physical object 
25 101 , and the radiance for the oriented surface. The image intensities can be 

corrected using alpha-blending available in the graphics hardware. The details of 
this step are further described below. Standard rendering usually assume that the 
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image 131 is projected onto a single flat surface, e.g., a screen of a display device 
or a piece of paper, that is perpendicular to and centered on the optical axis. In our 
system 100, the image 131 can be projected onto a complex object or structure 
having any number of arbitrarily shaped surface oriented at various angles, 
5 including curved portions. This step produces a corrected image 141 . 

Cross-Fading 

As stated above, our invention can illuminate an entire object 101 or structure so 
10 that the viewer can perform a walk-around or walk-through to get a view of the 
l3 object from various angles. In this case, multiple images need to be projected. If 

multiple projectors 1 60 are used, then there are multiple instances of steps 400, 
1 130, 140, 150. Because the projected images can overlap, an additional corrective 
W step 150, described in greater detail below needs to be applied to obtain a final 
%5 projectable image 151. This step 150 feathers or blends the images in regions of 
» overiap. The feathered or blended unages 15 1 are supplied to the prqjector(s) 1 60, 
P as a last step, to illuminate the 3D physical object 101 . In one embodiment, we use 
P two digital projectors displaying pixels at a 1024x768 resolution. 

20 At this point, it should be noted that the projected images can vary dynamically to 
take into consideration varying viewer locations, object orientations, and virtual 
lightings. Alternatively, the object can be rotated to enable the user to view the 
object at various angles from a single location, or the images can make a stationary 
object appear as if it is rotating. The illuminated environment can also be a large 

25 scale structure in which the user is virtually immersed. In this case, the 

illumination can be done with one or more steerable laser beams instead of a 
projector. 

12 
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The entire method 100 can be summarized by the following pseudo code. 

During Pre-Processing 

Acquiring a 3D graphics model of a 3D physical object 
Approximately positioning a projector 

Determining a pose of the projector with respect to the physical object 
During Run-Time 

Defining a user location 

Editing the model to reflect a desired appearance of the object 

Modifying the edited model based on the user location 

Rendering the model as an image based on the pose and user location 

correcting image intensities for a surface orientation of the object 

Blending image intensities for overlap and occlusion 

Projecting the corrected and blended modified image on the object 

We will now describe the details of realistically treating the illuminating a physical 
object with virtual images, and the components and steps of the system according 
to the invention. 

Illumination and Appearance 

The appearance of a surface of an object or a structure is determined by the 
radiance at its surface. Hence, we generate any arbitrary appearance on a neutral 
surface by procedurally rearranging the incident radiance. In other words, the 
appearance on the neutral surface of the 3D physical object 101 is achieved by 
rendering an image for a given viewer location and warping the image during 
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intensity corrections 140. In order to change the appearance of the physical object, 
we rearranging the terms in the relationship between illumination and reflectance 
to reproduce an equivalent radiance at any surface point of the 3D physical object 
101. 

5 

Figure 3a shows the radiance in a certain direction at a point x 301, which has a 
given BRDF in the physical world, that can reach an observer 302. As shown in 
Figure 3b, this radiance can be mimicked by changing the BRDF and illuminating 
the point 302 with a appropriately located light source, e.g., a projector pixel 303. 
10 Below, we describe a process for adjusting the radiance and for determining the 
S necessary intensity of the projector pixels, given the position and orientation of the 
rj viewer and the virtual scene. 

W For a more systematic rendering, we mtroduce the notion of separating the 
15 rendering view (Figure 3a), i.e., the traditional virtual camera view, from the 
W shading view (Figure 3b), i.e., the position of the viewer with respect to the 3D 
=5 physical object 101 for lighting calculations. 

We begin with a basic rendering equation that approximates the geometries of the 
20 various optical paths. Such an equation is described by Kajiya in ''The Rendering 
Equation:' Computer Graphics 20(4), pp. 143-151, 1986. The radiance at the 
visible surface pomt x 301 in the dkection {9, (p) that reaches the observer 302 of a 
physical realization of the scene is 

L{x,d,(^) = g{x,eMLAx,0,^) + Hx,d,^)) , (i) 

25 where 

/i(x,^,<!)) = jF,(x,^,^,^,^,)L,(x,^,,^^,.)cos(^,)JfiJ, , (2) 

14 
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and g(x,0,</>) is a geometry term, i.e., visibility and distance, L^(x,d,^) is the 
emitted radiance at the point, non-zero only for light sources, and F^{x,d,(j),d,,(j).) 
is the BRDF of the point. The integral in hix,e,(f>) accounts for all reflection of 
incident radiance L.{x,6.,0.) from solid angles dco. . Radiance has dimensions of 
energy per unit time, area and solid angle. 

Treating each of the projected pixels as a point emitter, the radiance due to direct 
projector illumination at the same surface point at distance d(x), but with diffuse 
reflectance (x) is expressed by 

L\x,e,^)=g(x,d,^)K(x)i^{x,e^,(P^)cos(e^)/d(x)\ (3) 

where (x,d^,^^ ) is the radiant intensity of the projected pixel in the direction 
(Op,</>p), and is related to a discretized pixel value via filtering and tone 
representation. 

We can reproduce radiance L\x,d,(/)) equivalent to L(x,<9,^) , for the given viewer 
location, by solving equation (3) for the radiance intensity Ipi 

Thus, as long as the diffuse reflectance ku(x) is non-zero for all of the wavelengths 
represented in L{x,6,(p) , we can effectively represent the surface attributes with 
appropriate corrected pixel intensities. However, in practice, the range of values 
that can be displayed are limited by the brightness, dynamic range and pixel 
resolution of the projector. It should be noted that if the 3D physical object 101 is 
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illuminated by a steerable laser beam, the intensity of the beam can be varied 
accordingly while it scans across the object. 

The rendering 130 according to our invention involves two viewpoints: the user's 
viewpoint and the projector's viewpoint. A simple approach would first render the 
image as seen by the user, which is represented by L(x,d,0) as shown in Figure 
3a, and then use traditional image-based rendering techniques to warp this image to 
generate the intensity corrected projected image, represented by (x,e^ ) , see 
Chen et al., "View Interpolation from Image Synthesis," SGGRAPH '93, pp. 279- 
288, 1993, and McMillan et al. "Plenoptic Modeling," SIGGRAPH '95, pp. 39-46, 
1995. 

For a viewer location that is changing, view-dependent shading under static 
lighting conditions could also be implemented, see Levoy et al. "Light Field 
Rendering," SIGGRAPH '96, pp.31-42, 1996. 

However, we can avoid the warp because the display medium, i.e., the 3D physical 
object 101 is geometrically identical as the virtual 3D model 122. 

For a single-pass rendering, we treat the location 133 of the user as the shading 
view. Then, the image rendering process involves rendering the scene from the 
projector's view, by using a perspective projection matrix that matches the 
projector's intrinsic and extrinsic parameters 401, followed by radiance adjustment 
as described above. 
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The separation of these two views has some interesting side effects. For a static 
projector, the visibility and view-independent shading determinations need to be 
performed only once, even when the location of the user is changing. To realize a 
real-time interactive implementation, we use the 3D rendering engine 130 which 
5 approximates the general rendering equation (1). 

The BRDF process is divided into view-dependent specular, and view-independent 
diffuse and ambient components. View-independent shading determinations can be 
performed by assuming that the rendering and shading view are the same. We 
10 determine virtual shadows, also view-independent, by using a two-pass shadow- 
buffer technique. 

9 However, traditional rendering engines do not support specular highlights for 

W view-dependent shading for two separate views. Therefore, we perform the steps of 

15 the procedure 200 as shown in Figure 2. 

|i Secondary Scattering 

Because the 3D physical object 101 has a neutral surface with diffuse 
20 characteristics, secondary scattering is unavoidable. This can potentially affect the 
quality of the appearance of the illuminated object 101. When the virtual 3D model 
122 is purely diffuse, the secondary scattering can be used to our advantage. 

The geometric relationships, also known as form factors, among parts of the 
25 physical object 101 are naturally identical to same parts of the virtual 3D model 
122. Consider the radiosity solution for a patch i in a virtual scene with m light 
sources and n patches: 

17 
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^.-intended = K J^BjF,^ = K^B^F,^ + J^B^F, ^ \ (5) 

i V " / 



Here, the value kd is the diffuse reflectance, Bj is the radiance of patchy, and Fij is 
the form factor between patches. Using the system 100 to reproduce the effect of 
5 direct illumination, after radiance adjustment, we are able to generate the effect of 
m light sources: 

m 

However, due to secondary scattering, if the neutral surface have diffuse 
So reflectance k^., then the perceived radiance also includes the secondary scattering 
W due to the n patches, and that gives us 

::p 5j.actual = ^i-direct + ■Sj-secondary = ^d^^m^i.m ^u^^n^,,n ' 



The difference between the desired and perceived radiance is 

iK.-K)lB„F,, 



(8) 



Thus, in cases where kd and K are similar, we get approximate radiosity for "free." 
The projection of even a simple direct illumination rendering produces believable 
"spilling" of colors on neighboring parts of the physical objects. From the equation 
20 above, the secondary contribution from the neutral surface is certainly not accurate, 
even when we reproduce the first bounce exactly. The difference is even larger 
when the virtual object has non-Lambertian reflectance properties. 
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Illumination of All Visible Parts of the Surface of the Object 

For a complex physical object such as the Taj Mahal miniature 101, it is a problem 
to determine a good set of poses (location and orientation) for the projectors so that 

5 every visible surface is illuminated by at least one projector. This problem was 
partially addressed by Stuerzlmger '"Imaging all Visible Surfaces" Graphics 
Interface '99, pp. 115-122, 1999. Using a hierarchical visibility algorithm, he 
determined a set of camera viewpoints such that every visible part of every surface 
is illuminated by at least one projector. The problem of determining an optimal set 

10 of poses is NP-hard and is related to the art gallery problem described by 

9 O'Rourke "Art Gallery Theorems and Algorithms" Oxford University Press, New 

* York, 1987, known in the field of computational geometry. 

W Detailed System Components and Method Steps 

is 

m Image-based illumination of physical objects, in addition to the illumination 
+: problem described above, is still faced with several other problems. First is the 
U geometric registration problem. There is a need to match the projection of each 2D 

image 151 with the physical 3D object 101. This could be done manually. 
20 However, the projection of a perspective device has up to eleven degrees of 

freedom. Therefore, any effort to manually achieve the registration is likely to be 
extremely tedious and error prone. Below, we describe an automated technique for 
performing this registration. The second problem, for which there is no know 
solution, is the complete illumination of non-trivial physical objects in presence of 
25 overlap, shadows and self occlusion. 
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Using digital projectors or steerable lasers, and real-time 3D graphics rendering, 
we enable new approaches for image-based illumination. First, we determine a 3D 
geometric understanding of the geometry of the object 101, the projector(s) 160, 
the viewer 132, and the desired virtual lighting 133. We also provide a procedure 
5 that performs an important intensity correction step 140, and deal with occlusion, 
shadows and overlapping images in steps 150. 

Registration 

10 One of the important tasks in achieving a compelling visualization is to associate 
5 the graphical primitives of the virtual 3D model 122 with the physical object 101 
fl so that the appearance of the neutral object is enhanced. For example, we can 
Q specify any texture image that is to be displayed on the surface of the object with 
W the authoring tools 1 20. We can also alter the color distribution that improves the 
15 appearance of the object. We would also like to view the object under various 
ffif lighting conditions, real or imagined. 

5 As shown in Figure 4, we use a processor 401 to register 400 the 3D physical 
object 101 with the model 1 1 lobtained by the scanner 1 10 as follows. The 

20 processor 401 is coupled to all of the hardware components that comprise the 

system 100 of Figure 1. This processor can also include memory, and software and 
hardware of the editor and rendering engine. 

First, we approximately position each of the projectors 160-161, generally, 
25 illuminating device, so that projected images would substantially illuminate the 
physical object 101. Next, for each projector, we illuminate the 3D physical object 
101 with a calibration cross-hair 403. An input device 402 is used to align the 

20 
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projected cross-hair 403 with each of the selected 3D calibration points 112 of the 
physical object 101 in turn. This determines the coordinates of the corresponding 
2D calibration pixels 1 14, i.e., the pixel at the exact center of the cross-hair 402. 
We then adapt the geometric relationship between the 3D and 2D points to 

5 determine the projector's intrinsic and extrinsic parameters 401 from the rigid 
transformation between the coordinate systems of the object and the projectors. 
The transform is in the form a 3x4 perspective projection matrix up to scale, which 
is decomposed to find the intrinsic and the extrinsic parameters 401 of the 
projectors, see Faugeras "ThreeDimensional Computer Vision: A Geometric 

10 Viewpoints MIT Press, Cambridge, Massachusetts, 1993. The rendering step 130 

5 uses the same internal and external parameters 401 , so that the projected images 
151 are exactly registered with the physical object 101, even if the projectors are 

5 approximately positions with respect to the 3D physical object 101 . 

1 5 Intensity Correction 

■p The intensity of each rendered image 1 3 1 is corrected 140 on a per pixel basis, for 

P each projector, to take into account the reflectance of the neutral surface of the 3D 
physical object 101, the local orientation and distance with respect to the projector 

20 using equation (4). Because the surface normals used to determine the l/cos(9p) 
correction are available only at the vertices in a polygonal graphics model, we 
exploit the rendering engine 130 for approximate interpolation. We illuminate a 
white diffuse version of the 3D model 1 1 1, or a model matching appropriate non- 
zero diffuse reflectance Kix) of the physical model 101, with a virtual white light 

25 placed at the location of the projector, and correct the intensity for inverse squared 
distance attenuation. 
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The resultant intensities of the pixels vary smoothly across curved surface due to 

d(xy 

shading interpolation and inversely proportional to a factor — — — — . If the 

illumination device 160 has a limited dynamic range, we only illuminate portions 
of the surface with and angle Op < 60 because at greater angles l/cos(0) ranges 

5 from two to infinity. This avoids a low sampling rate of the projected pixels on 
severely obUque portion of the surface. This also minimizes mis-registration 
artifacts due to any errors in the geometric registration 400. During the calculations 
to locate overlap regions, described below, severely oblique portions of the surface 
are considered not to be illuminated by any projector, i.e., these are in the 

i) "shadow." 

O Correcting Occlusions and Overlaps by Cross-Fading 

1. For a complete illumination of the 3D physical object 101 , we use more than one 
16 projector. This leads to the difficult problem of seamlessly merging images from 
+; multiple projectors. A naive solution would simply illuminate any given surface 
E patch by only one given projector. However, there are two main problems when 

dealing with overlapping projectors. The fnst problem is the lack of color 

equivalence between neighboring projectors due to manufacturing process and 
20 temperamre color drift during their use, see Majumder et al. '"Color Calibration of 

Projectors for Large Tiled Displays" Proceedings of IEEE VisuaUzation 2000, 

IEEE Science Press, 2000. 

The second problem is our desire to minimize the sensitivity to small errors in the 
25 estimated geometric calibration parameters, or mechanical variations. Therefore, 
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we use feathering or cross-fading, also known as blending, to adjust the intensities 
of pixels in regions where images overlap. 

Cross-fading has been used in the past to generate seamless panoramic photo- 
5 mosaics by combining several views from a single location, SzeUski et al., 

"Creating Full View Panoramic Mosaics and Environment Maps," SIGGRAPH 97 
Conference Proceedings, pp. 251-258, 1997. Similar techniques are known in 
multi-projector wide-field-of-view displays, see Raskar et al. ''Multi-Projector 
Displays Using Camera-Based Registration" Proceedings of IEEE Visualization 
10 '99, 1999, and two-dimensional arrays of flat projections. In the prior art systems, 
5 the overlap region is typically a well-defined contiguous region on a planar display 
surface, as well as in each projector's frame buffer. 

y Figure 5 shows a profile of prior art cross-fading. In this case, projectors 501-502 

|5 illuminate a single continuous flat surface 500 with overlapping source images to 

W generate a single target image. Known correction methods typically simply weight 

+: the intensities 503-504 of the corresponding pixels proportional to their Euclidean 

P distance to a nearest boundary 505, or non-contribution pixel 506 of an adjacent 
image. The weights, in the range [0, 1], are multiplied by the pixel intensities to 

20 determine the final target image. 

The pixels weights near the boundary of a source image are near zero so that these 
pixels contribute very little to the target image, and there is a smooth transition to 
the adjacent image. Under ideal conditions, and assuming color equivalence, the 
25 weight contribution of all pixels of all images in the target image adds up to one. 
Even when the color response of one projector is different than any of the other 
projectors, the target image usually transitions smoothly in the region of overlap 
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507. This weight assignment strategy works well only when the source images 
illuminate a continuous planar surface at and around the region of overlap 507. 

In our case, as shown in Figure 6, the surface 600 of the physical object 101 
5 illuminated by the projectors 601-602 can be constructed of variously shaped parts, 
or the scene can include multiple different objects, a collection of disjoint objects 
resulting in shadows and fragmented overlap regions. More important for complex 
objects, overlap regions can containing surfaces with depth discontinuities 
resulting in occlusions. Now, with unequal color response, the resultant intensity 
10 distribution has offending sharp changes, e.g., at points 603-604. 



This problem is similar to the problems faced in image-based rendering (IBR), 
5 where warped, depth-enhanced images generate dis-occlusion artifacts. When 
* multiple source images are warped to the target image, the color assigned to a 
15 particular pixel needs to be derived, either from a single target image where the 
m pixel values overwrite each other, or as a weighted combination of corresponding 

pixels from source images. 

Feathering, which actually blurs the result, is usually necessary to overcome minor 
20 color difference in corresponding pixels in source images, and to hide ghosting 
effects due to small mis-registration errors. 

One of the few solutions to this problem is described by Debevec et al. '"Efficient 
View-Dependent Image-Based Rendering with Projective Texture -Mapping" Proc. 
25 of 9th Eurographics Workshop on Rendering, 1998. They scale the intensities by 
weights proportional to the angles between the source images and the target image. 
However, this does not guarantee that the weights will transition smoothly across 
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the surface of the scene. As a result, seams can appear in the target image where 
neighboring polygons are rendered with very different combinations of images. 
The plots 610 show the sample weighting scheme based on their solution and the 
corresponding problems. 

5 

Our method reUes on the following three constraints. First, the sum of the intensity 
weights of all of the corresponding source pixels should be one in the target image 
so that the intensities are normalized. Second, the weights of source pixels of a 
particular projector along a physical surface change smoothly in and near regions 
10 of overlap so that the inter-projector color differences do not have visible 
S discontinuity in the target unage. Third, the distribution of intensity weights of the 
S pixels of a particular projector within its frame-buffer is smooth so that small 
9 errors in calibration, or mechanical variations do not result in sharp edges. 

n 

1^5 In practice, it is easier to achieve and maintain precise geometric calibration than 
15 to ensure color equality among a set of projectors over a period of time. This 

Assess 

f makes the second constraint condition more important than the third. However, it 
M= is not always possible to satisfy the second and third constraints, e.g., when an 

occluded part moves closer to the display surface at 605. Hence, these constraints 
20 suggest solving the feathering problem, without violating the weight constraints at 

depth discontinuities and shadow boundaries. 

Instead of using the distance to the nearest boundary pixel to find the pixel 
weights, as in the prior art, our method 700, as shown in Figure 7 proceeds as 
25 follows. First, we locate 7 10 a first set of pixels corresponding to regions 
illuminated by a single projector. We assign 720 each of these first pixels an 
intensity weight of one. Then, for each remaining pixel (in a second set), we next 
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locate 730 a nearest pixel having a weight of one. To measure distances, we 
compute Euclidean (straight-line) distances. In step 730, we ignore paths that cross 
depth discontinuities. In step 740, we assign a weight to each second pixel that is 
inversely proportional to the distance to the nearest eligible pixel with a weight of 
5 one. 

Even under different color response, our method generates smooth transitions on 
planar surface in the presence of shadows and fragmented overlaps as shown by 
plot 620 of Figure 6. Our method can also be used for more than two projectors, 

10 and for blending pixels in regions of overlap and occlusion resulting from image- 

:|j based rendering systems. 

i In a practical implementation, we use two buffers, a depth buffer and an overlap 
S buffer. The depth buffer is updated with weighted pixel intensities while rendering 
;a^5 1 30 the virtual 3D model 1 22. The overlap buffer contains an integer value for 
M each pixel in the target image. The integer value indicates the number projectors 
* that contribute source pixels that that target pixel. Weights for pixels in regions of 
P overlap, i.e., pixels having corresponding integer values greater than one in the 

overlap buffer, use traditional shadow-buffer techniques. The pseudo code 800 for 
20 the method 700 is shown m Figure 8. 

For some pixels in the overlap region, it may not be possible to locate a nearest 
pixel for which an overlap count of one can be found. In this case, the shortest 
distance is assumed to be a large value. This substantially reduces the weight in 
25 isolated regions and also cuts down unnecessary transition zones. 
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Simulating Motion of a Static object in a Scene 

Figure 9 shows the steps of a method 900 for simulating motion of a static object 
in a static scene. First we generate a 3D model 91 1 of the object and the scene 901, 
5 for example a neutral toy car statically positioned on a white back-drop. Then, we 
register 920 the projector 960 with the object and the model as described above. 
User and lighting locations can also be provided. 

Next, the 3D model 91 1 is segmented 930 to indicate the different parts that move 
10 and animate separately with different degrees of freedom. For example, in the 
■Q segmented 3D model 93 1, the wheels, are segmented from the car body, and the 
J Ji entire car is segmented leaving a back-drop portion as a separate portion. It should 
% be understood that the number of segments is entirely dependent on a particular 
apphcation. 

a5 

The segmented 3D model 931 is edited 940 to create a virtual segmented model 
j: 941 . The motion of the various segment can independently consider rotation, 
N= translation, and other effects to provide the appropriate illusionary movement. 

For example, motion blur and moving shadows can be added, reflections and other 
20 global visual effects can be added as necessary 

If required, a consistent relationship can be enforced between the motion of 
different parts. For example, the clock- wise rotation of the wheels is consistent 
with the backward motion of the road- surface on the back-drop to give the 
25 impression that the car is speeding forwards. The virtual 3D model is then rendered 
corrected, and projected in real-time onto the object and scene 901. 
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Applications 

In a simplest form, the system 100 can be used to dynamically change the 
appearance of day-to-day objects and structures, or to add temporary markings on 
5 them. For example, engineers can mark areas of interest, such as drilling locations, 
without affecting the physical surface. Urban planners can visualize city models 
under various lighting condition. For stage and outdoor shows, the system can 
change backdrops as well as objects and props in scenes. Sculptors can illuminate 
clay models to simulate various textures and finishing materials before the design 
10 is finalized. Image-based illumination can also be used in the movie industry to 
D enhance details on miniature sets. The system can also be used with interior and 
jfl exterior architectural models. Instead of building exact detailed physical replicas, 
O the designer can use simple Styxofbam components, such as boxes, spheres and 
W cylinders, that are virtually decorated. 
15 

00 We have described a new mode for the visualization of 3D computer graphics, 
j; which involves light projectors and physical objects to generate rich detailed 

images directly in the user's world to enable new ways of interacting with 

synthetic imagery. We have presented techniques that make image-based 
20 illumination of non-trivial objects practical. Our invented visualization system 

method is compelling for a variety of applications including training, architectural 

design, art and entertainment. 
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